AITopics

2605.17238

Country: North America > United States (0.67)

Genre: Research Report (0.40)

Industry:

Information Technology > Services (0.67)
Consumer Products & Services (0.46)

Technology:

Information Technology > Artificial Intelligence > Representation & Reasoning > Optimization (0.46)
Information Technology > Artificial Intelligence > Representation & Reasoning > Search (0.34)
Information Technology > Artificial Intelligence > Representation & Reasoning > Uncertainty > Bayesian Inference (0.34)
Information Technology > Artificial Intelligence > Machine Learning > Learning Graphical Models > Directed Networks > Bayesian Learning (0.34)

Neural Information Processing SystemsFeb-11-2026, 23:06:21 GMT

Machine Learning Estimation of Heterogeneous Treatment Effects with Instruments

Vasilis Syrgkanis, Victor Lei, Miruna Oprescu, Maggie Hei, Keith Battocchi, Greg Lewis

Neural Information Processing Systems http://nips.cc/

instrument, projection, treatment effect, (14 more...)

Country:

Oceania > Australia > New South Wales > Sydney (0.04)
North America > United States (0.04)
North America > Canada (0.04)

Genre:

Research Report > Experimental Study (0.69)
Research Report > New Finding (0.68)

Technology:

Information Technology > Artificial Intelligence > Representation & Reasoning (0.93)
Information Technology > Artificial Intelligence > Machine Learning > Ensemble Learning (0.69)
Information Technology > Artificial Intelligence > Machine Learning > Statistical Learning > Regression (0.68)

Neural Information Processing SystemsDec-25-2025, 06:32:22 GMT

Machine Learning Estimation of Heterogeneous Treatment Effects with Instruments

We consider the estimation of heterogeneous treatment effects with arbitrary machine learning methods in the presence of unobserved confounders with the aid of a valid instrument. Such settings arise in A/B tests with an intent-to-treat structure, where the experimenter randomizes over which user will receive a recommendation to take an action, and we are interested in the effect of the downstream action. We develop a statistical learning approach to the estimation of heterogeneous effects, reducing the problem to the minimization of an appropriate loss function that depends on a set of auxiliary models (each corresponding to a separate prediction task). The reduction enables the use of all recent algorithmic advances (e.g.

heterogeneous treatment effect, machine learning estimation, name change, (4 more...)

Technology: Information Technology > Artificial Intelligence > Machine Learning (1.00)

Neural Information Processing SystemsOct-9-2025, 16:39:38 GMT

Supplement to " Metadata-based Multi-Task Bandits with Bayesian Hierarchical Models " Anonymous Author(s) Affiliation Address email A Review of Statistical Concepts 1

Supplement to "Metadata-based Multi-T ask Bandits with Bayesian Hierarchical Models" See [11, 42] for more detailed discussions. Consider a supervised learning problem, where we have N subjects. Finally, these three models are all special case of the following hierarchical model (a.k.a. The aforementioned statistical concepts are typically introduced for supervised learning. It is easy to see this model is a random effect model.

artificial intelligence, bandit, machine learning, (16 more...)

Genre: Research Report > New Finding (0.46)

Technology:

Information Technology > Data Science (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Statistical Learning (0.68)
Information Technology > Artificial Intelligence > Representation & Reasoning > Uncertainty (0.67)
Information Technology > Artificial Intelligence > Machine Learning > Learning Graphical Models (0.46)

Neural Information Processing SystemsOct-9-2025, 13:48:10 GMT

Machine Learning Estimation of Heterogeneous Treatment Effects with Instruments

Vasilis Syrgkanis, Victor Lei, Miruna Oprescu, Maggie Hei, Keith Battocchi, Greg Lewis

Neural Information Processing Systems http://nips.cc/

artificial intelligence, machine learning, treatment effect, (16 more...)

Country: North America (0.46)

Genre:

Research Report > Experimental Study (0.69)
Research Report > New Finding (0.68)

Technology:

Information Technology > Artificial Intelligence > Machine Learning > Ensemble Learning (0.69)
Information Technology > Artificial Intelligence > Machine Learning > Statistical Learning > Regression (0.68)

Salvadé, Nicolas, Hillel, Tim

Functional effects models: Accounting for preference heterogeneity in panel data with machine learning

arXiv.org Machine LearningSep-23-2025

In this paper, we present a general specification for Functional Effects Models, which use Machine Learning (ML) methodologies to learn individual-specific preference parameters from socio-demographic characteristics, therefore accounting for inter-individual heterogeneity in panel choice data. We identify three specific advantages of the Functional Effects Model over traditional fixed, and random/mixed effects models: (i) by mapping individual-specific effects as a function of socio-demographic variables, we can account for these effects when forecasting choices of previously unobserved individuals (ii) the (approximate) maximum-likelihood estimation of functional effects avoids the incidental parameters problem of the fixed effects model, even when the number of observed choices per individual is small; and (iii) we do not rely on the strong distributional assumptions of the random effects model, which may not match reality. We learn functional intercept and functional slopes with powerful non-linear machine learning regressors for tabular data, namely gradient boosting decision trees and deep neural networks. We validate our proposed methodology on a synthetic experiment and three real-world panel case studies, demonstrating that the Functional Effects Model: (i) can identify the true values of individual-specific effects when the data generation process is known; (ii) outperforms both state-of-the-art ML choice modelling techniques that omit individual heterogeneity in terms of predictive performance, as well as traditional static panel choice models in terms of learning inter-individual heterogeneity. The results indicate that the FI-RUMBoost model, which combines the individual-specific constants of the Functional Effects Model with the complex, non-linear utilities of RUMBoost, performs marginally best on large-scale revealed preference panel data.

effect model, functional effect, intercept, (14 more...)

2509.18047

Country:

North America > United States > New York > New York County > New York City (0.04)
Europe > United Kingdom > England > Oxfordshire > Oxford (0.04)
Europe > United Kingdom > England > Greater London > London (0.04)
(3 more...)

Genre: Research Report (1.00)

Industry:

Transportation (0.93)
Health & Medicine > Therapeutic Area (0.46)

Technology:

Information Technology > Artificial Intelligence > Machine Learning > Statistical Learning (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Ensemble Learning (0.88)
Information Technology > Artificial Intelligence > Representation & Reasoning > Uncertainty > Bayesian Inference (0.68)
(2 more...)

Tang, Tiffany M., Levina, Elizaveta, Zhu, Ji

Interpretable Network-assisted Random Forest+

arXiv.org Machine LearningSep-22-2025

Machine learning algorithms often assume that training samples are independent. When data points are connected by a network, the induced dependency between samples is both a challenge, reducing effective sample size, and an opportunity to improve prediction by leveraging information from network neighbors. Multiple methods taking advantage of this opportunity are now available, but many, including graph neural networks, are not easily interpretable, limiting their usefulness for understanding how a model makes its predictions. Others, such as network-assisted linear regression, are interpretable but often yield substantially worse prediction performance. We bridge this gap by proposing a family of flexible network-assisted models built upon a generalization of random forests (RF+), which achieves highly-competitive prediction accuracy and can be interpreted through feature importance measures. In particular, we develop a suite of interpretation tools that enable practitioners to not only identify important features that drive model predictions, but also quantify the importance of the network contribution to prediction. Importantly, we provide both global and local importance measures as well as sample influence measures to assess the impact of a given observation. This suite of tools broadens the scope and applicability of network-assisted machine learning for high-impact problems where interpretability and transparency are essential.

feature importance, penalty, prediction, (15 more...)

2509.15611

Country:

North America > United States > Michigan (0.04)
Asia > Singapore (0.04)

Genre: Research Report > New Finding (0.67)

Industry:

Health & Medicine (0.93)
Education (0.68)

Technology:

Information Technology > Artificial Intelligence > Machine Learning > Decision Tree Learning (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Statistical Learning > Regression (0.49)

Wei, Changshuai, Elston, Robert C., Lu, Qing

A weighted U statistic for association analysis considering genetic heterogeneity

arXiv.org Artificial IntelligenceAug-18-2025

Converging evidence suggests that common complex diseases with the same or similar clinical manifestations could have different underlying genetic etiologies. While current research interests have shifted toward uncovering rare variants and structural variations predisposing to human diseases, the impact of heterogeneity in genetic studies of complex diseases has been largely overlooked. Most of the existing statistical methods assume the disease under investigation has a homogeneous genetic effect and could, therefore, have low power if the disease undergoes heterogeneous pathophysiological and etiological processes. In this paper, we propose a heterogeneity weighted U (HWU) method for association analyses considering genetic heterogeneity. HWU can be applied to various types of phenotypes (e.g., binary and continuous) and is computationally effcient for high- dimensional genetic data. Through simulations, we showed the advantage of HWU when the underlying genetic etiology of a disease was heterogeneous, as well as the robustness of HWU against different model assumptions (e.g., phenotype distributions). Using HWU, we conducted a genome-wide analysis of nicotine dependence from the Study of Addiction: Genetics and Environments (SAGE) dataset. The genome-wide analysis of nearly one million genetic markers took 7 hours, identifying heterogeneous effects of two new genes (i.e., CYP3A5 and IKBKB) on nicotine dependence.

artificial intelligence, genetic heterogeneity, machine learning, (16 more...)

arXiv.org Artificial Intelligence

doi: 10.1002/sim.6877

1504.08319

Country: North America > United States > Texas (0.14)

Genre: Research Report > Experimental Study (0.94)

Industry:

Health & Medicine > Pharmaceuticals & Biotechnology (1.00)
Health & Medicine > Therapeutic Area > Psychiatry/Psychology > Addiction Disorder (0.69)

Technology: Information Technology > Artificial Intelligence > Machine Learning > Statistical Learning (0.68)

Murakami, Ryo, Miura, Seiji, Endo, Akihiro, Minamoto, Satoshi

Sparse mixed linear modeling with anchor-based guidance for high-entropy alloy discovery

arXiv.org Machine LearningApr-30-2025

REGULAR ARTICLE Sparse mixed linear modeling with anchor-based guidance for high-entropy alloy discovery Ryo Murakami a, Seiji Miura b, Akihiro Endo a and Satoshi Minamoto a a Materials Data Platform, Research Network and Facility Services Division, National Institute for Materials Science, Tsukuba 305-0044, Ibaraki, Japan b Division of Materials Science and Engineering, Faculty of Engineering, Hokkaido University, Sapporo 060-8628, Hokkaido, Japan ARTICLE HISTORY Compiled April 30, 2025 ABSTRACT High-entropy alloys have attracted attention for their exceptional mechanical properties and thermal stability. To solve this problem, machine learning techniques have been increasingly employed for property prediction and high-throughput screening. Nevertheless, highly accurate nonlinear models often suffer from a lack of interpretability, which is a major limitation. In this study, we focus on local data structures that emerge from the greedy search behavior inherent to experimental data acquisition. By introducing a linear and low-dimensional mixture regression model, we strike a balance between predictive performance and model interpretability. In addition, we develop an algorithm that simultaneously performs prediction and feature selection by considering multiple candidate descriptors. Through a case study on high-entropy alloys, this study introduces a method that combines anchor-guided clustering and sparse linear modeling to address biased data structures arising from greedy exploration in materials science. KEYWORDS Sparse modeling; Mixed linear model; Bayesian inference; Materials informatics; Data-driven science; High-entropy alloys 1. Introduction In recent years, high-entropy alloys (HEAs) have garnered attention as next-generation materials for their outstanding mechanical properties, thermal stability, and corrosion resistance [1,2]. Unlike conventional alloy designs, HEAs--also referred to as multi-principal element alloys--comprise multiple (typically five or more) principal elements, offering a high degree of chemical and structural freedom. This unique composition enables the exploration of novel properties unattainable in traditional materials systems.

alloy, artificial intelligence, machine learning, (20 more...)

2504.20354

Country:

Asia > Japan > Honshū > Kantō > Ibaraki Prefecture > Tsukuba (0.24)
Asia > Japan > Hokkaidō > Hokkaidō Prefecture > Sapporo (0.24)

Genre: Research Report > New Finding (0.51)

Technology:

Information Technology > Artificial Intelligence > Machine Learning > Statistical Learning (1.00)
Information Technology > Artificial Intelligence > Representation & Reasoning > Uncertainty > Bayesian Inference (0.88)
Information Technology > Artificial Intelligence > Machine Learning > Learning Graphical Models > Directed Networks > Bayesian Learning (0.46)